Tim Kraska
This article contains paid contributions. It may require cleanup to comply with Wikipedia's content policies, particularly neutral point of view. |
Tim Kraska | |
---|---|
Born | |
Nationality | German |
Alma mater | ETH Zurich (PhD) University of Münster (MSc)(BS) University of Sydney (MSc) |
Known for | Learned indexes |
Awards | VLDB early career (2018) NSF Career Award (2015) Intel Outstanding Researcher Award (2021) |
Scientific career | |
Fields | Computer Science |
Institutions | Massachusetts Institute of Technology Brown University |
Website | people |
Tim Kraska is a German computer scientist specializing in data systems and the intersection of systems and machine learning. He is currently an associate professor of computer science at the Massachusetts Institute of Technology.[1]
Education
[edit]Kraska received his PhD from the Swiss Federal Institute of Technology in Zürich in 2010,[2] his Master's of Science degrees from University of Münster in Germany and University of Sydney in 2006,[2] and a Bachelor of Science in Information Systems also from University of Münster in 2004.[2]
Career
[edit]Kraska was at the University of California-Berkeley's AMPLab as a post-doctoral scholar from 2010 to 2012.[3] Kraska joined Brown University's computer science department as an assistant professor in January 2013.[2] During this time, his focus was on big data management and hybrid human/machine data base systems.[2] He was later promoted to adjunct professor in January 2018.
Kraska co-founded Einblick Analytics, a startup based on the Northstar research project, which builds a collaborative data platform to enable teams to work together.[4][5][6]
Kraska has been involved in the development of numerous data systems, such as building a database on S3,[7] which proposed for a first time the separation of compute and storage for cloud database systems as now used by Snowflake and many other systems; Tupleware, a compilation framework for data analytic workflows;[8] CrowdDB, a database system that automatically uses crowd-sourcing for data cleaning tasks;[9] and Northstar, an interactive data science system.[10] Kraska co-founded Einblick Analytics in 2010, a startup based on Northstar, which builds a collaborative data platform to enable teams to work together.[4][5][6]
Kraska developed the concept of Learned Indexes, which he developed while at Google.[11] It is now used as a part of Google BigTable,[12] has been integrated into RocksDB,[13] and has been used in other applications such as DNA sequence alignment[14] and internet packet classification.[15]
Kraska has also developed the first Instance Optimized Database Systems.[16]
Kraska has published more than 150 scholarly articles, has been cited more than 8,500 times and has an h-index of 43.[17]
Awards
[edit]Kraska received the Siemens Prize and the Master of Information Technology Scholarship for outstanding achievement from the University of Sydney, both in 2005.[2] Kraska received a German Academic Exchange Service scholarship in 2006.[2] Kraska received a Swiss National Science Foundation Prospective Researcher Fellowship in 2010.[2] Kraska received the VLDB best demo award in 2011[2] and the VLDB Early Career Award in 2018.[18]
Kraska received the NSF Career Award in 2015,[19][20] the Google Faculty Research Award in 2015 for his proposal “Human-In-the-Loop Data Exploration”,[21] the VMware Systems Research Award in 2017,[22] and the Intel Outstanding Researcher Award in 2021.[23]
Kraska was the PC Track Chair for the 2016 SIGMOD conference,[24] and he was the Program Vice Chair for the 2019 conference.[25] Kraska was awarded the Sloan Research Fellowship for computer science in 2017.[26][17]
Kraska is on the advisory board of the Northeast Big Data Innovation Hub.[27]
Scientific Publications
[edit]- Amber Feng, Michael Franklin, Donald Kossmann, Tim Kraska, Samuel Madden, Sukriti Ramesh, Reynold Xin, “CrowdDB: Sourcing the VLDB Crowd” (Demo Paper), Proceedings of the VLDB Endowment (PVLDB), 4(12), pp. 1387–1390, 2011.[9]
- Jiannan Wang, Tim Kraska, Michael J. Franklin, Jianhua Feng, “CrowdER: Crowdsourcing Entity Resolution,” Proceedings of the VLDB Endowment (PVLDB), 5(11), pp. 1483–1494, 2012.[28]
- Matthias Brantner, David Graf, Daniela Florescu, Donald Kossmann, Tim Kraska, “Building a Database on S3,” Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 251–264, 2008.[7]
- Tim Kraska, Ameet Talwalkar, John Duchi, Rean Griffith, Michael Franklin, Michael Jordan, “MLbase: A Distributed Machine-learning System,” Proceedings of the Conference on Innovative Data Systems Research (CIDR), 2013.[29]
- Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis, “The Case for Learned Index Structures,” Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 489–504, 2018.[11]
- Donald Kossmann, Tim Kraska, Simon Loesing, “An Evaluation of Alternative Architectures for Transaction Processing in the Cloud,” Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 579–590, 2010.[30]
- Tim Kraska, Martin Hentschel, Gustavo Alonso, Donald Kossmann, “Consistency Rationing in the Cloud: Pay only when it matters,” Proceedings of the VLDB Endowment (PVLDB), 2(1), pp. 253–264, 2009.[31]
- Tim Kraska, Gene Pang, Michael Franklin, Samuel Madden, Alan Fekete, “MDCC: Multi-Data Center Consistency,” Proceedings of the Eurosys Conference, pp. 113–126, 2013.[32]
- Carsten Binnig, Donald Kossmann, Tim Kraska, Simon Loesing, “How is the Weather tomorrow? Towards a Benchmark for the Cloud,” DBTest Workshop in conjunction with SIGMOD 2009, Providence, RI, 2009.[33]
- Jiannan Wang, Guoliang Li, Tim Kraska, Michael J. Franklin, Jianhua Feng, “Leveraging Transitive Relations for Crowdsourced Joins,” Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 229–240, 2013.[34]
Patents
[edit]- “Fine-grained and concurrent access to a virtualized disk in a distributed system,” US application no. 12/350,197, July 2009.[35]
- “Visual data computing platform using a progressive computation engine,” provisional application, July 2021. Zeyuan Shang, Emanuel Zgraggen, Benedetto Buratti, Philipp Eichmann, Navid Karimeddiny, Charlie Meyer, Wesley Runnels, Tim Kraska.
References
[edit]- ^ "Tim Kraska, Systems for ML, ML for Systems". Massachusetts Institute of Technology. Retrieved March 7, 2022.
- ^ a b c d e f g h i Amy Tarbox (August 16, 2012). "Michael Littman Returns to Brown with Professor Appointment; Tim Kraska and Paul Valiant to Join the Department as Assistant Professors". Brown University. Archived from the original on January 3, 2022. Retrieved March 7, 2022.
- ^ Mark Nickel. "Tim Kraska". Brown University. Retrieved March 7, 2022.
- ^ a b Ben Lorica (December 9, 2020). "A new startup from MIT and Brown lets users transform, visualize, and model data through a graphical user interface". Retrieved March 7, 2022.
- ^ a b Alex Woodie (September 20, 2021). "An Interactive Analytics Whiteboard for COVID Times". Retrieved March 7, 2022.
- ^ a b Rob Matheson (June 27, 2019). "Drag-and-drop data analytics". Cambridge, MA: MIT. Retrieved March 7, 2022.
- ^ a b Matthias Brantner, Daniela Florescu, David Graf, Donald Kossmann, Tim Kraska (June 9, 2008). "Building a database on S3". Proceedings of the 2008 ACM SIGMOD international conference on Management of data. pp. 251–264. doi:10.1145/1376616.1376645. ISBN 9781605581026. S2CID 8017949. Retrieved March 7, 2022.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ Andrew Crotty, Alex Galakatos, Kayhan Dursun, Tim Kraska, Carsten Binnig, Ugur Cetintemel, Stan Zdonik (August 1, 2015). "An architecture for compiling UDF-centric workflows". Proceedings of the VLDB Endowment. 8 (12): 1466–1477. doi:10.14778/2824032.2824045. Retrieved March 7, 2022.
{{cite journal}}
: CS1 maint: multiple names: authors list (link) - ^ a b Amber Feng, Michael Franklin, Donald Kossmann, Tim Kraska, Samuel Madden, Sukriti Ramesh, Andrew Wang, Reynold Xin. "CrowdDB: Query Processing with the VLDB Crowd" (PDF). VLDB. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Tim Kraska (2018). "Northstar". Proceedings of the VLDB Endowment. 11 (12): 2150–2164. doi:10.14778/3229863.3240493. hdl:1721.1/132273. S2CID 240346112. Retrieved March 7, 2022.
- ^ a b Tim Kraska, Alex Beutel, Ed H. Chi, Jeff Dean, Neoklis Polyzotis. "The Case for Learned Index Structures". Google Research. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Hussam Abu-Libdeh, Deniz Altmbüken, Alex Beutel, Ed H. Chi, Lyric Doshi, Tim Kraska, Xiaozhou (Steve) Li, Andy Ly, Christopher Olston (2020). "Learned Indexes for a Google-scale Disk-based Database" (PDF). Vancouver. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Yifan Dai, Yien Xu, Aishwarya Ganesan, Ramnatthan Alagappan, Brian Kroth, Andrea Arpaci-Dusseau and Remzi Arpaci-Dusseau. "From WiscKey to Bourbon: A Learned Index for Log-Structured Merge Trees" (PDF). University of Wisconsin-Madison. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Saurabh Kalikar, Chirag Jain, Md Vasimuddin, Sanchit Misra (February 28, 2022). "Accelerating minimap2 for long-read sequencing applications on modern CPUs". Nature Computational Science. 2 (2): 78–83. doi:10.1038/s43588-022-00201-8. PMID 38177520. S2CID 247186356. Retrieved March 7, 2022.
{{cite journal}}
: CS1 maint: multiple names: authors list (link) - ^ Alon Rashelbach, Ori Rottenstreich, Mark Silberstein (July 30, 2020). "A Computational Approach to Packet Classification". Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication. pp. 542–556. arXiv:2002.07584. doi:10.1145/3387514.3405886. ISBN 9781450379557. S2CID 211146132. Retrieved March 7, 2022.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ Tim Kraska, Mohammad Alizadeh, Alex Beutel, Ed H. Chi, Jialin Ding, Ani Kristo, Guillaume Leclerc, Samuel Madden, Hongzi Mao, Vikram Nathan (2019). "SageDB: A Learned Database System" (PDF). Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ a b "Tim Kraska: H-index & Awards - Academic Profile". research.com. Retrieved March 7, 2022. As of March 7, 2022, the exact figures are: "H-index: 43; Citations: 8,841; Publications: 155; World Ranking: 4009; National Ranking (United States): 2037".
- ^ "VLDB Early Career Award". VLDB Endowment. Retrieved March 7, 2022.
- ^ "NSF Award Abstract # 1453171 CAREER: Query Compilation Techniques for Complex Analytics on Enterprise Clusters". National Science Foundation. June 9, 2015. Retrieved March 7, 2022.
- ^ Jesse Polhemus (June 17, 2015). "Tim Kraska Wins NSF CAREER and AFOSR Young Investigator Awards". Brown University. Retrieved March 7, 2022.
- ^ Jesse Polhemus (August 28, 2015). "Tim Kraska, Andy van Dam, And Carsten Binnig Win A Google Faculty Research Award". Brown University. Retrieved March 7, 2022.
- ^ "VMware Systems Research Award". VMware. Retrieved March 7, 2022.
- ^ "Intel's 2021 Outstanding Researcher Awards Recognize 17 Academic Achievers". Intel. 2021. Retrieved March 7, 2022.
- ^ "The 2016 ACM SIGMOD/PODS Conference: San Francisco, USA - Organization: SIGMOD Program Committee". 2016. Retrieved March 7, 2022.
- ^ "Organization: Conference Chairs". 2019. Retrieved March 7, 2022.
- ^ "Alfred P. Sloan Research Fellowships 2017" (PDF). Alfred P. Sloan Foundation. February 21, 2017. Retrieved March 7, 2022.
- ^ "Leadership Team". Northeast Big Data Innovation Hub. 11 May 2016. Retrieved March 7, 2022.
- ^ Jiannan Wang, Tim Kraska, Michael J. Franklin, Jianhua Feng (2012). "CrowdER: Crowdsourcing Entity Resolution" (PDF). VLDB. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Tim Kraska, Ameet Talwalkar, John Duchi, Rean Griffith, Michael Franklin, Michael Jordan (2013). "MLbase: A Distributed Machine-learning System" (PDF). Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Donald Kossmann, Tim Kraska, Simon Loesing (2010). "An Evaluation of Alternative Architectures for Transaction Processing in the Cloud" (PDF). Brown University. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Tim Kraska, Martin Hentschel, Gustavo Alonso, Donald Kossmann (2009). "Consistency Rationing in the Cloud: Pay only when it matters" (PDF). Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Tim Kraska, Gene Pang, Michael Franklin, Samuel Madden, Alan Fekete (April 15, 2013). "MDCC: Multi-Data Center Consistency" (PDF). Prague, Czechia: Massachusetts Institute of Technology. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Carsten Binnig, Donald Kossmann, Tim Kraska, Simon Loesing (2009). "How is the Weather tomorrow? Towards a Benchmark for the Cloud" (PDF). Brown University. Retrieved March 7, 2022.
{{cite web}}
: CS1 maint: multiple names: authors list (link) - ^ Jiannan Wang, Guoliang Li, Tim Kraska, Michael J. Franklin, Jianhua Feng (June 22, 2013). "Leveraging transitive relations for crowdsourced joins". Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. pp. 229–240. arXiv:1408.6916. doi:10.1145/2463676.2465280. ISBN 9781450320375. S2CID 8830515. Retrieved March 7, 2022.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ US 2009177658A1, Mathias Brantner, David Graf, Donald Kossmann, Tim Kraska, "Fine-Grained and Concurrent Access to a Virtualized Disk in a Distributed System", issued 2009-07-09